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The technology underlying text search engines has advanced dramatically in the past 
decade. The development of a family of new index representations has led to a wide 
range of innovations in index storage, index construction, and query evaluation. While 
some of these developments have been consolidated in textbooks, many specific 
techniques are not widely known or the textbook descriptions are out of date. In this 
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Endeca's approach to processing search queries in a distributed computing environment is 
predicated on concerns of correctness, scalability, and flexibility in deployment. Using a 
master-slave architecture, we are able to support classic search as well as more advanced 
features. We avoid bottlenecking the master with excessive computation or 
communication by limiting the information from the slaves in the expected case. 
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The technology underlying text search engines has advanced dramatically in the past 
decade. The development of a family of new index representations has led to a wide 
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Geographic web search engines allow users to constrain and order search results in an 
intuitive manner by focusing a query on a particular geographic region. Geographic search 
technology, also called local search, has recently received significant interest from major 
search engine companies. Academic research in this area has focused primarily on 
techniques for extracting geographic knowledge from the web. In this paper, we study the 
problem of efficient query processing in scalable geogr ... 

Shape-based retrieval and analysis of 3D models 
Thomas Funkhouser, Michael Kazhdan 

August 2004 ACM SIGGRAPH 2004 Course Notes SIGGRAPH '04 
Publisher: ACM Press 

Full text available: *p ?| pdf(12.56 MB) Additional Information: full citation , abstract 



http://portal.acm.org/results.cfm?coll=ACM&(U=ACM&CFro=2n 



6/12/2007 



Results (page 1): partition and range and rank and "search engine" and query and index Page 1 of 7 




USPTO 



Subscribe (Full Service) Register (Limited Service, Free) Login 

Search: ® The ACM Digital Library O The Guide 
jpartition and range and rank and ''search engine ' and query ar 



: .-,*.v:*:«&«£«3 



Terms used partition and range and rank and search 
engine and query and index 



Sort results j re j evance i^f ^ Save results to a Binder 

by I fwi 

^ Search Tips 

□ Open results in a new 
window 



Display 
results 



j expanded form ^| 



Feedback Report a problem Satisfaction 
survey 

Found 38,068 of 203,282 

Try an Advanced Search 

Try this search in The ACM Guide 



Results 1 - 20 of 200 
Best 200 shown 

1 



Result page: 12345678910 next 

Relevance scale □ Q 



Inverted files for text search engines 
Justin Zobel, Alistair Moffat 

July 2006 ACM Computing Surveys (CSUR), volume 38 issue 2 
Publisher: ACM Press 

Full text available: |f | pdf(944.29 KB) Additional Information: full citation , abstract , references , index terms 

The technology underlying text search engines has advanced dramatically in the past 
decade. The development of a family of new index representations has led to a wide 
range of innovations in index storage, index construction, and query evaluation. While 
some of these developments have been consolidated in textbooks, many specific 
techniques are not widely known or the textbook descriptions are out of date. In this 
tutorial, we introduce the key techniques in the area, describing both a core impl ... 

Keywords: Inverted file indexing, Web search engine, document database, information 
retrieval, text retrieval 



2 Querying and web: Efficient query processing in geographic web search engines 
#> Yen_Yu Chen, Torsten Suel, Alexander Markowetz 

>r June 2006 Proceedings of the 2006 ACM SIGMOD international conference on 
Management of data SIGMOD "06 

Publisher: ACM Press 

Full text available: ^ pdf(296.76 KB) Additional Information: full citation , abstract , references , index terms 

Geographic web search engines allow users to constrain and order search results in an 
intuitive manner by focusing a query on a particular geographic region. Geographic search 
technology, also called local search, has recently received significant interest from major 
search engine companies. Academic research in this area has focused primarily on 
techniques for extracting geographic knowledge from the web. In this paper, we study the 
problem of efficient query processing in scalable geogr ... 




Shape-based retrieval and analysis of 3D models 
Thomas Funkhouser, Michael Kazhdan 

August 2004 ACM SIGGRAPH 2004 Course Notes SIGGRAPH '04 
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Full text available: fjl pdf(12.56 MB) Additional Information: full citation , abstract 
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The technology underlying text search engines has advanced dramatically in the past 
decade. The development of a family of new index representations has led to a wide 
range of innovations in index storage, index construction, and query evaluation. While 
some of these developments have been consolidated in textbooks, many specific 
techniques are not widely known or the textbook descriptions are out of date. In this 
tutorial, we introduce the key techniques in the area, describing both a core impl ... 

Keywords: Inverted file indexing, Web search engine, document database, information 
retrieval, text retrieval 
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Publisher: ACM Press 

Full text available: ||] pdf(296.76 KB) Additional Information: full citation , abstract , references , index terms 

Geographic web search engines allow users to constrain and order search results in an 
intuitive manner by focusing a query on a particular geographic region. Geographic search 
technology, also called local search, has recently received significant interest from major 
search engine companies. Academic research in this area has focused primarily on 
techniques for extracting geographic knowledge from the web. In this paper, we study the 
problem of efficient query processing in scalable geogr ... 
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1 Analysis of the generalized clock buffer replacement scheme for database 
A transaction processing 
^ Victor F. Nicola, Asit Dan, Daniel M. Dias 

June 1992 ACM SIGMETRICS Performance Evaluation Review , Proceedings of the 
1992 ACM SIGMETRICS joint international conference on Measurement 
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Volume 20 Issue 1 

Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 
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Full text available: || | pdf(1.22 MB) ' 



The CLOCK algorithm is a popular buffer replacement algorithm because of its simplicity 
and its ability to approximate the performance of the Least Recently Used (LRU) 
replacement policy. The Generalized Clock (GCLOCK) buffer replacement policy uses a 
circular buffer and a weight associated with each page brought in buffer to decide on 
which page to replace. We develop an approximate analysis for the GCLOCK policy under 
the Independent Reference Model (IRM) that applies to many database tra ... 

Searching in high-dimensional spaces: Index structures for improving the 

performance of multimedia databases 

Christian Bohm, Stefan Berchtold, Daniel A. Keim 

September 2001 ACM Computing Surveys (CSUR), volume 33 issue 3 

Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 
terms 



Full text available: ^ pdf(1.39 MB) 



During the last decade, multimedia databases have become increasingly important in 
many application areas such as medicine, CAD, geography, and molecular biology. An 
important research issue in the field of multimedia databases is the content-based 
retrieval of similar multimedia objects such as images, text, and videos. However, in 
contrast to searching data in a relational database, a content-based retrieval requires the 
search of similar objects as a basic functionality of the database system ... 

Keywords: Index structures, indexing high-dimensional data, multimedia databases, 
similarity search 
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Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 
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During the last decade, multimedia databases have become increasingly important in 
many application areas such as medicine, CAD, geography, and molecular biology. An 
important research issue in the field of multimedia databases is the content-based 
retrieval of similar multimedia objects such as images, text, and videos. However, in 
contrast to searching data in a relational database, a content-based retrieval requires the 
search of similar objects as a basic functionality of the database system ... 

Keywords: Index structures, indexing high-dimensional data, multimedia databases, 
similarity search 



2 Optimizing result prefetching in web search engines with segmented indices 

Ronny Lempel, Shlomo Moran 
^ February 2004 ACM Transactions on Internet Technology (TOIT), Volume 4 issue l 
Publisher: ACM Press 

Full text available* f^l pdf(183 97 KB) Additional Information: full citation , abstract , references , citings , index 

! terms 

We study the process in which search engines with segmented indices serve queries. In 
particular, we investigate the number of result pages that search engines should prepare 
during the query processing phase. Search engine users have been observed to browse 
through very few pages of results for queries that they submit. This behavior of users 
suggests that prefetching many results upon processing an initial query is not efficient, 
since most of the prefetched results will not be requested by the ... 

Keywords: Distributed inverted indices, prefetching, search engines 
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1 Evaluation: Four scorers and seven years ago: the scoring method for MUC-6 
Nancy Chinchor 

November 1993 Proceedings of the 6th conference on Message understanding MUC6 
95 

Publisher: Association for Computational Linguistics 

Full text available: ^| pdf(405.37 KB) Additional Information: full citation , abstract , references , citings 

The MUC-6 scoring method is based on a two-step process of mapping an item generated 
by a system under evaluation (the "response") to the corresponding item in the human- 
generated answer key and then scoring the mapped items. The resulting scores are used 
for decision-making over the entire evaluation cycle, including refinement of the task 
definition based on interannotator comparisons, technology development using training 
data, validating answer keys, and benchmarking both system and human ca ... 



2 Machine learning for coreference resolution: from local classification to global ranking B 
Vincent Ng 

June 2005 Proceedings of the 43rd Annual Meeting on Association for Computational 

Linguistics ACL '05 
Publisher: Association for Computational Linguistics 

Full text available: ||) pdf(1 16.60 KB) Additional Information: full citation , abstract , references 

In this paper, we view coreference resolution as a problem of ranking candidate partitions 
generated by different coreference systems. We propose a set of partition-based features 
to learn a ranking model for distinguishing good and bad partitions. Our approach 
compares favorably to two state-of-the-art coreference systems when evaluated on three 
standard coreference data sets. 



3 Large scale gymnastics championships: An on-line interactive collection and analysis H 
|k of scores 

Daniel L. Bernitt, James S. Walton, Frederick S. Bader 

June 1975 Proceedings of seventh international conference on APL APL '75 
Publisher: ACM Press 

Full text available: ^ pdf(629.24 KB) Additional Information: full citation , abstract , references , index terms 

APL has proved to be a magnificent and simple tool for the collection and analysis of the 
rather complicated scoring in large-scale regional, national or international gymnastics 
championships, a task previously done by teams of human scorers. 
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Full text available: i p 1 |j pdf(405.37 KB) Additional Information: full citation , abstract , references , citings 

The MUC-6 scoring method is based on a two-step process of mapping an item generated 
by a system under evaluation (the "response") to the corresponding item in the human- 
generated answer key and then scoring the mapped items. The resulting scores are used 
for decision-making over the entire evaluation cycle, including refinement of the task 
definition based on interannotator comparisons, technology development using training 
data, validating answer keys, and benchmarking both system and human ca ... 



Machine learning for coreference resolution: from local classification to global ranking Q 
Vincent Ng 

June 2005 Proceedings of the 43rd Annual Meeting on Association for Computational 

Linguistics ACL '05 
Publisher: Association for Computational Linguistics 

Full text available: " H pdff 116.60 KB) Additional Information: full citation , abstract , references 

In this paper, we view coreference resolution as a problem of ranking candidate partitions 
generated by different coreference systems. We propose a set of partition-based features 
to learn a ranking model for distinguishing good and bad partitions. Our approach 
compares favorably to two state-of-the-art coreference systems when evaluated on three 
standard coreference data sets. 
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|k of scores 

Daniel L. Bernitt, James S. Walton, Frederick S. Bader 

June 1975 Proceedings of seventh international conference on APL APL '75 
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Full text available: * g[ pdf(629.24 KB) Additional Information: full citation , abstract , references , index terms 

APL has proved to be a magnificent and simple tool for the collection and analysis of the 
rather complicated scoring in large-scale regional, national or international gymnastics 
championships, a task previously done by teams of human scorers. 
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4 Answering Clinical Questions with Knowledge-Based and Statistical Techniques m 
Dina Demner-Fushman, Jimmy Lin 

March 2007 Computational Linguistics, volume 33 issue l 

Publisher: MIT Press 

Full text available: pdf(295.45 KB) Additional Information: full citation , abstract 

The combination of recent developments in question-answering research and the 
availability of unparalleled resources developed specifically for automatic semantic 
processing of text in the medical domain provides a unique opportunity to explore 
complex question answering in the domain of clinical medicine. This article presents a 
system designed to satisfy the information needs of physicians practicing evidence-based 
medicine. We have developed a series of knowledge extractors, which employ a ... 

5 Optimizing result prefetching in web search engines with segmented indices Q 
Ronny Lempel, Shlomo Moran 

February 2004 ACM Transactions on Internet Technology (TOIT), volume 4 issue l 
Publisher: ACM Press 



Full text available- f fl pdf(183 97 KB) Additiona ' Information: full citation , abstract , references , citings , index 

~ terms 

We study the process in which search engines with segmented indices serve queries. In 
particular, we investigate the number of result pages that search engines should prepare 
during the query processing phase. Search engine users have been observed to browse 
through very few pages of results for queries that they submit. This behavior of users 
suggests that prefetching many results upon processing an initial query is not efficient, 
since most of the prefetched results will not be requested by the ... 

Keywords: Distributed inverted indices, prefetching, search engines 



6 Music: Approximate matching algorithms for music information retrieval using vocal H 

i> input 

Richard L. Kline, Ephraim P. Glinert 

November 2003 Proceedings of the eleventh ACM international conference on 
Multimedia MULTIMEDIA '03 

Publisher: ACM Press 

Full text available: Wj pdf(165.02 KB) Additional Information: full citation , abstract , references , citings, index 

terms 

Effective use of multimedia collections requires efficient and intuitive methods of 
searching and browsing. This work considers databases which store music and explores 
how these may best be searched by providing input queries in some musical form. For the 
average person, humming several notes of the desired melody is the most straightforward 
method for providing this input, but such input is very likely to contain several errors. 
Previously proposed implementations of so-called query-by-humm ... 

Keywords: music information retrieval, query by humming 



7 LOF: identifying density-based local outliers 
<% Markus M. Breunig, Hans-Peter Kriegel, Raymond T. Ng, Jorg Sander 

X May 2000 ACM SIGMOD Record , Proceedings of the 2000 ACM SIGMOD international 
conference on Management of data SIGMOD '00, volume 29 issue 2 
Publisher: ACM Press 

Full text available: pdf(263.81 KB) Additional Information: full citation , abstract , references , citings , index 
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For many KDD applications, such as detecting criminal activities in E-commerce, finding 
the rare instances or the outliers, can be more interesting than finding the common 
patterns. Existing work in outlier detection regards being an outlier as a binary property. 
In this paper, we contend that for many scenarios, it is more meaningful to assign to each 
object a degree of being an outlier. This degree is called the local outlier factor (LOF) of 
an object. It is local in th ... 

Keywords: database mining, outlier detection 



8 Evaluation: Overview of results of the MUC-6 evaluation B 
Beth M. Sundheim 

November 1993 Proceedings of the 6th conference on Message understanding MUC6 
•95 

Publisher: Association for Computational Linguistics 

Full text available: l | | pdf(1.56 MB) Additional Information: full citation , abstract , references , citings 

The latest in a series of natural language processing system evaluations was concluded in 
October 1995 and was the topic of the Sixth Message Understanding Conference (MUC-6) 
in November. Participants were invited to enter their systems in as many as four different 
task-oriented evaluations. The Named Entity and Coreference tasks entailed Standard 
Generalized Markup Language (SGML) annotation of texts and were being conducted for 
the first time. The other two tasks, Template Element and Scenario ... 

9 Perspectives on testing for programming aptitude H 
Jack M. Wolfe 

^ January 1971 Proceedings of the 1971 26th annual conference 
Publisher: ACM Press 

Full text available* *pi pdf(918 38 KB) Addjtjonal Information: full citation , abstract , references , citings , index 

! terms 

This paper discusses limitations of programming aptitude tests. The use of multiple-choice 
type questions, the testwiseness of the college graduate group, and the inclusion of 
questions of mathematical information tend to diminish the effectiveness of such tests as 
predictors of success in business programming. Timed tests favor the faster worker over 
the slower one who is often more accurate and of equal or better logical capability. The 
usual criterion of a test's validity is itself often ... 

Keywords: Aptitude, Aptitude tests, Criterion, Evaluation, Multiple choice questions, 
Occupational groups, Programming aptitude, Reliability, Tests, Timed tests, Validity 

10 Objective quality ranking of computing journals Q 
<ft> Pairin Katerattanakul, Bernard Han, Soongoo Hong 

▼ October 2003 Communications of the ACM, volume 46 issue 10 
Publisher: ACM Press 

Full text available: < |g) pdf(84.99 KB) Additional Information: full citation , abstract , references , citings , index 
Nfl html( 18.23 KB) terms 

Citation analysis, which provides a clear picture of actual use of journals and their articles, 
is an effective way to determine a journal's influence. 

11 Selection of computer personnel: Selection of systems analysts and programmer 
trainees 



Raymond M. Berger 
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June 1968 Proceedings of the Sixth SIGCPR conference on Computer personnel 

research SIGCPR '68 
Publisher: ACM Press 

Full text available: |p pdf(861.44 KB) Additional Information: full citation , abstract , citings 

The subject of this talk is primarily the selection of Systems Analysts. Time permitting, I 
shall also discuss work done on the selection of programmer trainees. The research 
program, as many of you have heard described in previous talks, was divided into three 
phases. The first was the job analysis phase in which 17 job dimensions were revealed to 
cover systems analyst and programmer positions. The second phase dealt with the 
development of criterion tests for evaluating programmer and systems ... 

12 The need for accurate alignment in natural language system evaluation 
Andrew Kehler, John Bear, Douglas Appelt 

June 2001 Computational Linguistics, volume 27 issue 2 

Publisher: MIT Press 

Full text available: ^ ffjj] 

tgj pqf(l.37 MB) ^ Additional Information: full citation , abstract , references 
Publisher Site 

As evaluations of computational linguistics technology progress toward higher-level 
interpretation tasks, the problem of determining alignments between system responses 
and answer key entries may become less straightforward. We present an extensive 
analysis of the alignment procedure used in the MUC-6 evaluation of information 
extraction technology, which reveals effects that interfere with the stated goals of the 
evaluation. These effects are shown to be pervasive enough that they have the pote ... 

13 Eve gaze and multimodal integration patterns: Individual differences in multimodal 
integration patterns: what are they and why do they exist? 
Sharon Oviatt, Rebecca Lunsford, Rachel Coulston 

April 2005 Proceedings of the SIGCHI conference on Human factors in computing 

systems CHI '05 
Publisher: ACM Press 

Full text available 'pI pdfOQS 16 KB) Additional Information: full citation , abstract , references , citings , index 
^ terms 

Techniques for information fusion are at the heart of multimodal system design. To 
develop new user-adaptive approaches for multimodal fusion, the present research 
investigated the stability and underlying cause of major individual differences that have 
been documented between users in their multimodal integration pattern. Longitudinal 
data were collected from 25 adults as they interacted with a map system over six weeks. 
Analyses of 1,100 multimodal constructions revealed that everyone had a do ... 

Keywords: commands, conversations, disfluencies, errors, impulsive-reflective cognitive 
style, individual differences, multimodal integration patterns, simultaneous or sequential 
input 



14 SouthamptonTAC: An adaptive autonomous trading agent 
Minghua He, Nicholas R. Jennings 

August 2003 ACM Transactions on Internet Technology (TOIT), volume 3 issue 3 
Publisher: ACM Press 

Full text available" W pdf(571 24 KB) Additional Information: full citation , abstract , references , citings , index 
^ ' terms 

Software agents are increasingly being used to represent humans in on-line auctions. 
Such agents have the advantages of being able to systematically monitor a wide variety 
of auctions and then make rapid decisions about what bids to place in what auctions. They 
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can do this continuously and repetitively without losing concentration. Moreover, in 
complex multiple auction settings, agents may need to modify their behavior in one 
auction depending on what is happening in another. To provide a means ... 

Keywords: On-line auctions, bidding strategy, trading agent competition 



15 Information extraction: Comparing human and machine performance for natural 

language information extraction: results from the Tipster text evaluation 
Craig A. Will 

September 1993 Proceedings of a workshop on held at Fredericksburg, Virginia: 

September 19-23, 1993 
Publisher: Association for Computational Linguistics 

Full text available: || ] pdf(1.44 MB) Additional Information: fuil citation , abstract , references 

This paper presents results from a study comparing human performance on the text of 
natural language information extraction with that of machine extraction systems that 
were developed as part of the ARPA Tipster program. Information extraction is shown to 
be a difficult task for both humans and machines. Evidence for one set of text material, 
English Microelectronics, indicated that a human analyst produces about half the errors as 
does machine systems. 



16 The reliability of analytic and holistic methods in rating students' computer programs 
David M. Olson 

February 1988 ACM SIGCSE Bulletin , Proceedings of the nineteenth SIGCSE technical 
symposium on Computer science education SIGCSE '88, Volume 20 issue 1 
Publisher: ACM Press 

Full text available: ^|pdf(635.37 KB) Additional Information: full citation , abstract, references , citings, index 

terms 

The study examined analytic and holistic methods of scoring students's programs. The 
sample included 30 students in an introductory computer science class. The inter-rater 
reliabilities for both scoring methods was significant (analytic, r = .66; holistic, r = .73). 
The correlation between students' holistic and analytic scores on the programs was 
moderate (r = .66), indicating that the two scoring methods measure different attributes 
of a student's program. 



17 Retrieval effectiveness of an ontology-based model for information selection 
Latifur Khan, Dennis McLeod, Eduard Hovy 

January 2004 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 13 Issue 1 

Publisher: Springer- Verlag New York, Inc. 

Full text available: pdf(278.74 KB) Additional Information: full citation , abstract , citings , index terms , review 

Technology in the field of digital media generates huge amounts of nontextual 
information, audio, video, and images, along with more familiar textual information. The 
potential for exchange and retrieval of information is vast and daunting. The key problem 
in achieving efficient and user-friendly retrieval is the development of a search 
mechanism to guarantee delivery of minimal irrelevant information (high precision) while 
insuring relevant information is not overlooked (high recall). The tradi ... 

Keywords: Audio, Metadata, Ontology, Precision, Recall, SQL 
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SIGIR conference on Information storage and retrieval: theoretical issues 
in information retrieval SIGIR '81, volume ie issue l 
Publisher: ACM Press 

Full text available: ^E j pdf(570.54 KB) Additional Information: full citation , abstract , references 

This research examined the relationship between subjects' knowledge of an area and their 
similarity ratings of pairs of concepts. Knowledge was operationalized as (1) test score 
performance, and (2) correspondence with an experts similarity judgments. Subjects 
(n=55) were exposed to a three -week module on social psychology as part of their 
introductory psychology course. At the end of this course segment they were 
administered an examination and, later, a term similarity questionnaire. Students ... 

19 Training a selection function for extraction 
Chin-Yew Lin 

November 1999 Proceedings of the eighth international conference on Information 
and knowledge management CIKM '99 

Publisher: ACM Press 

Full text available' fS) pdf(2.38 MB) Additional Information: full citation , abstract , references , citings, index 
^ terms 

In this paper we compare performance of several heuristics in generating informative 
generic/query-oriented extracts for newspaper articles in order to learn how topic 
prominence affects the performance of each heuristic. We study how different query types 
can affect the performance of each heuristic and discuss the possibility of using machine 
learning algorithms to automatically learn good combination functions to combine several 
heuristics. We also briefly describe the design, implementa ... 

Keywords: automated text summarization, summary evaluation, topic extraction 
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We describe Web-a-Where, a system for associating geography with Web pages. Web-a- 
Where locates mentions of places and determines the place each name refers to. In 
addition, it assigns to each page a geographic focus — a locality that the page discusses 
as a whole. The tagging process is simple and fast, aimed to be applied to large 
collections of Web pages and to facilitate a variety of location-based applications and data 
analyses. Geotagging involves arbitrating two types of ambiguiti ... 

Keywords: disambiguation, gazetteer, geographic tagging, information retrieval, natural 
language processing, text mining 
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November 1993 Proceedings of the 6th conference on Message understanding MUC6 
95 

Publisher: Association for Computational Linguistics 

Full text available: pdf(405.37 KB) Additional Information: full citation , abstract , references , citings 

The MUC-6 scoring method is based on a two-step process of mapping an item generated 
by a system under evaluation (the "response") to the corresponding item in the human- 
generated answer key and then scoring the mapped items. The resulting scores are used 
for decision-making over the entire evaluation cycle, including refinement of the task 
definition based on interannotator comparisons, technology development using training 
data, validating answer keys, and benchmarking both system and human ca ... 
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June 2005 Proceedings of the 43rd Annual Meeting on Association for Computational 
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Full text available: ^ pdf(116.60 KB) Additional Information: full citation , abstract , references 

In this paper, we view coreference resolution as a problem of ranking candidate partitions 
generated by different coreference systems. We propose a set of partition-based features 
to learn a ranking model for distinguishing good and bad partitions. Our approach 
compares favorably to two state-of-the-art coreference systems when evaluated on three 
standard coreference data sets. 
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APL has proved to be a magnificent and simple tool for the collection and analysis of the 
rather complicated scoring in large-scale regional, national or international gymnastics 
championships, a task previously done by teams of human scorers. 
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